An optimization method for the inverted pendulum problem based on deep reinforcement learning
نویسندگان
چکیده
Abstract The inverted pendulum problem is a classical problem. starting at random position keeps moving upwards and aims to reach an upright position. has been solved through some methods based on deep reinforcement learning (DRL) such as Deep Deterministic Policy Gradient (DDPG). However, DDPG also disadvantages. policy not conducive action exploration. Moreover, the Q value needs be estimated reasonably accurately for accurate. Nevertheless, beginning of learning, there certain difference in estimation, parameters learned this time are easy deviate. Therefore, paper combining AdaBound with algorithm proposes optimization method problem, compares performance that four published baselines. experimental results show proposed outperforms above baselines extent.
منابع مشابه
Aggregation-Based Learning in the Inverted Pendulum Problem
We consider the problem of adapting approximate dynamic programming techniques to the inverted pendulum task. This is a particularly challenging task as we work with a relatively uninformative reinforcement signal and have no a priori information about our system. Success in this task requires an effective solution to the credit assignment problem, incorporation of noisy and biased information ...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملReinforcement Learning with Perturbation Method to Turn Unidirectional Linear Response Fuzzy Controller for Inverted Pendulum
In this paper, we present a unidirectional linear response fuzzy controller (FC) to control the inverted pendulum system. The performance of turning fuzzy controller is defined as an evaluation function and our proposed technique, which is based on the integration of reinforcement learning and a perturbation method, is utilized to diversity the search of minimization of the evaluation function....
متن کاملControl of Inverted Double Pendulum using Reinforcement Learning
In this project, we apply reinforcement learning techniques to control an inverted double pendulum on a cart. We successfully learn a controller for balancing in a simulation environment using Qlearning with a linear function approximator, without any prior knowledge of the system at hand. We do however fail to learn a controller for the swingup maneuver, which leads to a discussion on what mig...
متن کاملFundamental Constraints for the Inverted Pendulum Problem
This paper considers fundamental constraints that exist in the control of an inverted pendulum system. The inverted pendulum system is a single input two output (SITO) system which has an unstable pole. The limitations that are developed use several diierent techniques to show the diiculties in the control of this system. The analysis has applications in the control of SITO systems that have un...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of physics
سال: 2022
ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']
DOI: https://doi.org/10.1088/1742-6596/2296/1/012008